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Abstract. - We report the correlation analysis of various redshift surveys which shows that the 
available data are consistent with each other and manifest fractal correlations (with dimension 
D ~ 2) up to the present observational limits (ss 150 h _1 Mpc) without any tendency towards 
homogenization. This result points to a new interpretation of the number counts that represents 
the main subject of this letter. We show that an analysis of the small scale fluctuations allows 
us to reconcile the correlation analysis and the number counts in a new perspective which has 
a number of important implications. 


Ideally the study of the correlation analysis of galaxy distribution requires the knowledge 
of the position of all galaxies in space § 0 - In practice, the observation of angular positions 
plus the redshift provides a redshift catalogue in which galaxies are located in the three 
dimensional space, but such a catalogue is affected by a luminosity selection effect related 
to the observational point. In order to avoid this effect, one can define a maximum depth 
and include in the sample only those galaxies that would be visible from any point of this 
volume. This procedure defines a volume limited (VL) sample, whose statistical properties are 
unaffected by observational biases 0 0 . 

We discuss here the determination of the space density in various redshift and angular sur¬ 
veys. The underlying assumption used is that the space p{r ) and luminosity 4>{L) distributions 
are independent || . In such a way the number of galaxies for unit luminosity and unit volume 
can be written as v(L,r)d 3 rdL = p(r)d 3 r<p(L)dL. Although this assumption is not strictly 
valid in view of the correlation between galaxy positions and (absolute) luminosities, for the 
purpose of the present discussion this approximation is rather good Q. 

We start recalling the concept of correlation. If the presence of an object at the point 
tt influences the probability of finding another object at r%, these two points are correlated. 
Therefore there is a correlation at r if, on average G(r) = (n(O)n(r)) 7 ^ (n) , where we average 
over all occupied points chosen as origin. On the other hand, there is no correlation at r if 
G(r) « (n) 2 . The length scale Ao, which separates correlated regimes from uncorrelated ones, 
is the homogeneity scale. 
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In the analysis, it is useful to use || T(r) = G(r)/ < n > where < n > is the average 
density of the sample analyzed. The reason is that T(r) has an amplitude independent from 
the sample size, differently from G(r), and it is suitable for the comparison between different 
samples. 

T(r) can be computed by the following expression: 
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where D is the fractal dimension and B is the lower cut-off (see below). T(r) is the average 
density at distance r from an occupied point at fj and it is called the conditional average density 
[0. If the distribution is fractal up to a certain distance Ao, and then it becomes homogeneous, 
T(r) is a power law function of r up to Ao, and then it flattens to a constant value. Hence by 
studying the behavior of T(r) it is possible to detect the eventual scale-invariant properties of 
the sample. Instead the information given by the standard correlation function £(r) i § is 
biased by the a priori (untested) assumption of homogeneity j|. 

Given a certain sample with solid angle Q and depth R s , it is important to define which is 
the maximum distance up to which it is statistically meaningful to compute the correlation 
function. As discussed in [||, the conditional density T(r) has to be computed in spherical 
shells; in this way we do not make any assumption in the treatment of the boundaries 
conditions. For this reason, the maximum distance up to which we extend our analysis is 
the order of the radius Reff of the largest sphere fully contained in the sample volume. In 
such a way we do not consider in the statistics the points for which a sphere of radius r is 
not fully included within the sample boundaries. For this reason we have a smaller number of 
points and we stop our analysis at a shorter depth than other authors ones. 

When one evaluates the correlation function (or the power spectrum ||) beyond R e ff , 
then one makes explicit assumptions on what lies beyond the sample’s boundary. In fact, 
even in absence of corrections for selection effects, one is forced to consider incomplete shells 
calculating T(r) for r > R e ff, thereby implicitly assuming that what one does not find in the 
part of the shell not included in the sample is equal to what is inside. 

We show in Fig.l the determination of the conditional density in VL with the same cut 
in absolute magnitude, in different surveys (see ||lo| for a review on the subject). The match 
of the amplitudes and exponents is quite good. The main result is that galaxy distribution 
shows fractal correlations with dimension D ss 2 up to the limiting length R e ff , which is 
different for the various samples (ranging from 20 h~ 1 Mpc to about 150h~ 1 Mpc ) || [ fiof . 
There have been attempts to push R e ff to larger values by using various weighting schemes 
for the treatment of boundary conditions 0. These methods however, unavoidably introduce 
artificial homogenization effects and therefore should be avoided ||. A different way to get 
information for larger scales is presented in the following. 

Historically 0, the oldest type of data about galaxy distribution is given by the relation 
between the number of observed galaxies N(> /) and their apparent brightness /. It is easy to 
show that 0 N(> f) ~ where D is the fractal dimension of the galaxy distribution. In 
terms of the apparent magnitude / ~ xo _0 4m (note that bright galaxies correspond to small 
m), the previous relation becomes logIV(< m) ~ am with a = D/5 |0). In Fig.2 we have 
collected all the recent observations of N(< to) versus to [[Tlj. One can see that at small scales 
(small to) the exponent is a ~ 0.6, while at larger scales (large to) it changes into a ss 0.4. 
The usual interpretation 0 is that a ss 0.6 corresponds to D « 3 consistent with homogeneity, 
while a ~ 0.4 is the result of large scales galaxy evolution and space time expansion effects. 
On the basis of the previous discussion of the VL samples, we can see that this interpretation 
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is untenable. In fact, there are very clear evidences that, at least up to 150 h 1 Mpc there 
are fractal correlations 0 0 - so one would eventually expect the opposite behavior: a ~ 0.4 
(fractal with D ss 2) for small to, and a ss 0.6 for large m. An additional argument addressed 
in favor of homogeneity, at rather small scales, is the rescaling of angular correlations |^]. This 
again seems to be in contradiction with the properties observed in the VL correlation analysis. 

We show that this contradictory situation arises from the fact that, given the limited amount 
of statistical information corresponding to the various methods of analysis, only some of them 
can be considered as statistically valid, while others are strongly affected by finite size and 
other spurious fluctuations that may be confused with real homogenization |Hj]. We focus 
now on the possibility of extending the sample effective depth R e ff- In order to discuss this 
question, it is important to analyze the properties of the small scale fluctuations. To this aim, 
we introduce the conditional density in the volume V (r) as observed from the origin , defined 
as 


n{r ) = 


N(< r) 


3Bp_ r D-3 
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In principle Eq.|| should refer to all the galaxies present in the volume V(R). If instead we 
have a VL sample, we will see only a fraction Nvl{R ) = p ■ N(< R) (where p < 1) of the 
total number N(< R) of galaxies in V{R). If <f{L)dL is the fraction of galaxies whose absolute 
luminosity ( L ) is between L and L + dL fl^ ], p is given: 

JZ L <KL)dL 


0 < p = 
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The function (j){L) has been extensively measured |fL3| and it is a power law extending from a 
minimal value L m i n to a maximum value L* defined by an exponential cut-off. In Eq.|]Lyi, 
is the minimal absolute luminosity that characterizes the VL sample and L m i n is the fainter 
absolute luminosity (or magnitude M min ) surveyed in the catalog (usually M min ~ —11). 
Computing n(r), we expect (Fig.3 - insert panel) not to see any galaxy up to a certain distance 
l v . For a Poisson distribution this distance is of order of the mean average distance between 
neighboring galaxies, £ v ~ {V/N) 3. Of course, such a quantity is not intrinsic for a fractal 
distribution because it depends on the sample volume, while the meaningful measure is the 
average minimum distance between neighboring galaxies £ m i n , that is related to the lower 
cut-off of the distribution. For distances somewhat larger than £ m i n we expect therefore a 
raise of the conditional density because we are beginning to count some galaxies and n(r) is 
affected by the fluctuations due to the low statistics. It is therefore important to be able to 
estimate and control the minimal statistical length A, which separates the fluctuations due to 
the low statistics from the genuine behavior of the distribution. A simple argument for the 
determination on the length A is the folliwng (see also |[l]). At small scale, where there is 
a small number of galaxies, there is an additional term, due to shot noise, superimposed to 
the power law behavior of n(r), that destroys the genuine correlations of the system. Such a 
fluctuating term can be erased out by making an average over all the points in the survey. On 
the contrary, in the observation from the origin, only when the number of galaxies is larger 
than, say, ~ 30, then the shot noise term can be not important. This condition gives (from 
Eq.|) 
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for a typical VL sample with Mvl ~ M*, where B corresponds to the amplitude of the 
conditional density of all galaxies JTT| [jloj. This can be estimated from the amplitude of 
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r(r) in a VL sample divided by the correspondent p as defined in Eq.|J. We find (for typical 
catalogues) B th 10 4- 15 (h~ 1 Mpc)~ D ||TT|| . 

In Fig.3 we report the radial density estimated from the origin for different VL samples 
derived from the PP catalogue. The finite size transient behavior is evident and the correct 
scaling is reached for lengths larger than A « 50 h~ 1 Mpc (fi = 0.9sr), the same for all the 
VL samples. In Fig.2 we can see that this behavior is in perfect agreement with the full 
correlation analysis corresponding to smaller scales. In Table I we report the values of A for 
the various catalogues. We have checked the validity of these values for the available catalogues 
(CfAl, PP, SSRS1, LEDA, ESP), as well as for artificial simulations as a test. Indeed in all 
these catalogues one observes a well defined power law for R > A, corresponding to a fractal 
dimension D ss 2, up to the catalogue depth [O. It is remarkable to note that for the ESP 
catalogue this depth is ~ 800 4- 900 h~ 1 Mpc |10[ . 

The introduction of the minimal statistical length A has a very important effect on the 
number counts N{< m) and on the analysis of angular samples. For the number counts it 
is clear that if the majority of the galaxies in the survey are located at distances smaller 
than A this will not give us reliable statistical information. In particular, the region up to 
A is characterized by a strongly fluctuating regime, followed by a decay just after A (Fig.3 
insert panel). For integral quantities as the number counts, such a behavior can be roughly 
approximated by a constant conditional density over some range of scales. This will lead to 
an apparent exponent a ~ 0.6 as if the distribution would be really homogeneous. If instead 
the majority of galaxies lie in the region beyond A the number counts will correspond to the 
real statistical properties. 

To be more quantitative, suppose to have a certain survey characterized by a solid angle 
Q and we ask the following question: up to which apparent magnitude limit mum do we 
have to push our observations to obtain that the majority of the galaxies lie in the statistically 
significant region (r > A) ? Beyond this value of mu m we should recover the genuine properties 
of the sample because, as we have enough statistics, the finite size effects self-average out. From 
the previous condition for each solid angle fl we can find an apparent magnitude limit TO;,; m . 

To this aim, we can require that, in a ML sample, the peak of the selection function, which 
occurs at distance r pea k, satisfies the condition r pea k > A . The peak of the survey selection 

m I 

function occurs for M* s=s —19 and then we have r pea k ~ 10 5 . From the previous relation 

and Eq.[| we have that 

miim = M* - 5 log(A) + 25 « 14 - log(H) . (5) 

It follows that for m > 19 the statistically significant region is reached for almost any reasonable 
value of the survey solid angle. This implies that in deep surveys, if we have enough statistics, 
we readily find the right behavior (a = D/ 5), while it does not happens in a self-averaging 
way for the nearby samples. Hence the exponent a ss 0.4 found in the deep surveys (m > 19) 
is a genuine feature of galaxy distribution , and corresponds to real correlation properties. In 
the nearby surveys m < 17 we do not find the scaling region in the ML sample for almost any 
reasonable value of the solid angle. Correspondingly the value of the exponent is subject to 
the finite size effects, and to recover the real statistical properties of the distribution one has 
to perform an average. 

We can now go back to Fig.2 and give to it a completely new interpretation. At relatively 
small scales we observe a ~ 0.6 just because of finite size effects and not because of real 
homogeneity. This resolves the apparent contradiction between the number counts and the 
correlation in VL samples that show fractal behavior up to ~ 200 h~ 1 Mpc. For m > 19 we are 
instead sampling a distribution in which the majority of galaxies are at distances larger than 
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Table I. - In this table we summarize the characteristic properties of several redshift catalogues 
and their volume limited samples, fl is the solid angle, Rvl the depth of the VL sample and 
Nvl the total number of galaxies. The minimal statistical length A gives us the scale above 
which the analysis of the conditional density from the origin is statistically meaningful. 


Survey 

fl(sr) 

A {h x Mpc) 

RvL{h l Mpc) 

N V l 

CfAl 

1.8 

15 

40 

442 

CfA2 (North) 

1.3 

20 

101 

1031 

PP 

0.9 

50 

60 

990 

SSRS1 

1.75 

15 

60 

345 

LEDA(m=16) 

2 7r 

10 

80 

4550 

IRAS1.2Jy 

4 7r 

10 

60 

876 

ESP 

0.006 

300 




A and indeed a ~ 0.4, corresponding to D « 2, in full agreement with the correlation analysis. 
Note that the change of slope at m ~ 19 depends only weakly on the solid angle of the survey. 
In order to check that the exponent a ~ 0.4 is the real one we have made various tests on PP 
where also one observes a ss 0.6 at small values of m, but we know that the sample has fractal 
correlations from the complete space analysis [|ll] . An average of the number counts from all 
points leads instead to the correct exponent a ~ 0.4 because for average quantities the effective 
value of A becomes actually appreciably smaller (see |ll| for more details). Our conclusion 
is therefore that there is not any change of slope at m ~ 19, and we see the same exponent 
in the range 12 < m < 18, where the combined effects K-corrections, galaxy evolution and 
modification of the Euclidean geometry are certainly negligible, and in the range 19 < m < 28. 

Figures and Tables. - 
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Fig. 1. The spatial density T(r) computed in some VL samples of CfAl, PP, LEDA, APM, 
ESP, LCRS, SSRS1, IRAS and ESP and normalized to the corresponding factor, as explained 
in the text. 
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Fig. 2. - Galaxy number counts as a function of the apparent magnitude (in) in the visible 
B-band. a = D/5 » 0.6 
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Fig. 3. - Conditional density computed form one point in various VL samples of PP. The 
behavior of the average conditional density (Fig.2 Top panel) can be extended to larger scales 
by the conditional density form the vertex only for r > A where it becomes statistically 
meaningful. In the insert panel: Schematic behavior of the conditional density computed form 
a single point (the origin). 




